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@) Reconfigurable ASIC. 

(57) A configurable semi-conductor integrated cir- 
cuit with particular application as a re-con- 
figurable application specific device, in order to 
be able to rapidly switch between two or more, 
preferably several, configurations, the invention 
provides a configurable semi-conductor inte- 
grated circuit in which an area (1) thereof is 
formed with a plurality of cells (2) each having 
at least one function and interconnections with 
at least some other said cells (2). At least some 
of the plurality of cells have interconnections 
(25) which are electrically selectable as to their 
conduction state, and at least some of the 
plurality cells have interconnections (YA-YD) 
which are pre-wired. Each cell has two or more 
possible configurations, each configuration be- 
ing defined by the cell function and/or its inter- 
connection with other cells according to cell 
configuration data, and further comprising 
means (36, 38, 40) storing configuration data for 
at least two cell configurations (per cell) and 
means (30, 32, 34, 42, 48) to enable one of the 
possible cell configurations according to the 
cell configuration data selected. 
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The present invention relates to a configurable in- 
tegrated circuit, with particular mphasis on a re-con- 
figurable application specific device but without lim- 
itation to same. 

Micro-processors are designed into many appli- 
cations because of their low cost and high perfor- 
mance. However, for many applications such as im- 
age compression and digital signal processing they 
are too slow. Modifications to the basic micro-proces- 
sor architecture has led to several new devices, digi- 
tal signal processors (DSP), reduced instruction set 
computers (RISC) and custom processors (CP). Each 
of these devices are optimised to perform a restricted 
number of tasks but at very high speed. Many appli- 
cations require several types of such devices to ach- 
ieve the necessary level of performance. This is be- 
cause of the requirement to perform different types of 
computational tasks over a period of time or the lim- 
ited capability of each device. Essentially these de- 
vices are used as low cost high performance numer- 
ical engines, each optimised to implement a general 
class of algorithms. However, a designer frequently 
r quires a different architecture to efficiently imple- 
ment a new algorithm and the usual practice in such 
circumstances is to design a custom processor forthis 
task. This leads to long and expensive design cycles 
and does not allow the designer any flexibility to 
change the algorithm. 

Field programmable gate arrays (FPGAs) are 
commonly used to replace standard products and 
they could be used as a numerical engine. However, 
they are general purpose devices that cannot effi- 
ciently implement high speed circuits. In order to ach- 
i ve the level of complexity that is normally required, 
s veral FPGAs would be necessary which would in- 
crease the cost of the final system. Some FPGAs are 
configured using on chip static random access mem- 
ory (SRAM) and these devices can be re-program- 
med to perform different tasks which could lead to 
greater flexibility and higher levels of performance. 
However, these devices are connected to an external 
source of configuration data that is accessed by the 
d vice to configure internal resources. The time to 
configure or re-configure the FPGA can be several 
milliseconds, due to the necessity to import configur- 
ation data from an external source, and this time is 
several orders of magnitude too slow. Reconfigura- 
tion speeds of less than 100 nano-seconds are re- 
quired for high performance applications. As such 
FPGA's cannot be reconfigured fast enough to make 
them suitable for use as a high performance numeri- 
cal engine. In FPGAs a considerable amount of silicon 
area is committed to the configuration memory which 
is required to program interconnect resources. Whilst 
in theory FPGAs could accommodate an additional 
configuration by increasing the amount of on chip 
memory which is available to hold configuration data 
this would probably increase the size of the chip by 60 



per cent which would b prohibitiv for high density 
arrays. 

The aim of the invention is to provide a re-config- 
urable architecture which can rapidly switch between 
5 two or more, preferably several, configurations. An- 
other aim of this invention is to provide a device that 
is specifically optimised to carry out functions for nu- 
merically intensive applications. Another aim is to pro- 
vide a device that prior to the application of power 
10 contains one or more boot up primary configurations, 
suitable for configuring the device into the intended 
application. Afurther aim is to produce a device that 
has provision for passing data between successive 
configurations of the (base) device. A still further aim 
15 is to ensure that during configuration of the device, 
data is held in a safe condition and that switching cur- 
rents are minimised. A still further aim is to provide a 
configuration cache that will allow updating of config- 
uration memories that are not currently in use. An- 
20 other aim of the invention is to allow the device to se- 
lect its own configuration from an external source of 
configuration data. 

Another aim is to reduce the number of program- 
mable interconnections by pre-wiring a portion of the 
25 logic into the required configuration. 

A yet further aim is to increase performance of 
the device by pre-arranging specified primary func- 
tions to specific areas of the device such primary 
functions being substantially pre-wired. 
30 Accordingly one aspect of the invention provides 

a configurable semi-conductor integrated circuit in 
which an area thereof is formed with a plurality of cells 
each having at least one function and interconnec- 
tions with at least some other said cells, at least some 
35 of the plurality of cells having intercon nections which 
are electrically selectable ste to their conduction state, 
and at least some of the plurality of cells having inter- 
connections which are pre-wired, each cell has two or 
more possible configurations, each configuration be- 
40 ing defined by the cell function and/or its interconnec- 
tion with other cells according to cell configuration 
data, and further comprising means storing configur- 
ation data for at least two cell configurations (per cell) 
and means to enable one of the possible cell conf ig- 
45 urations according to the cell configuration data se- 
lected. 

By pre-wired in relation to interconnect we mean 
uninterruptable as to its conduction state. The config- 
uration data controls selection of the cell function 

50 and/or cell interconnections preferably using decod- 
ers or alternatively controlled directly from memory. 
Thus for example the cells configuration data deter- 
mines the routing of the signal through the cell. Direct 
conn ction paths exist between the configuration 

55 stores, the decoders and the selectable functions and 
interconnections. The term function as used herein 
may be a logic function, arithmetic function, or inter- 
connect function. A cell may have one or more of 
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these functions or a combination of two or more of 
these. Preferably the configuration data stores ar 
disposed in the cell. The desired configuration is se- 
lected using an instruction bus receiving signals from 
a sequencer and controller. One or more of the con- 
figurations may be p re-wired (ie. not programmable). 
Advantageously one or more of the configuration data 
stores are programmable using a data transfer bus. 
Where more than one store is programmable an in- 
struction update bus is provided to write enable the 
required configuration store. Configuration stores not 
currently accessed to control interconnection and/or 
cell function can be updated using the instruction up- 
date bus. 

Since the present invention is particulary con- 
cerned with an application specific device which is 
optimised to perform a restricted number of tasks at 
high speed but which is quickly reconf igurable during 
program execution (when required) to perform some 
other specific task, cells are optimised for a primary 
function according to a primary configuration. Advan- 
tageously the primary configuration data is pre- wired. 
It is convenient to have two alternate pre-wired pri- 
mary configurations. Cells can be and most usually 
will be optimised for different primary functions. Ad- 
vantageously the pre-wired interconnections are 
used in connection with the optimised functions. 

A_possible primary function is that of an adder. 
Another aspect of the invention provides a multi-bit 
adder for summing at least two multi-bit words com- 
prising a first multi-bit adder block for summing the 
least significant bits and at least one further multi-bit 
adder block for summing the most significant bits and 
having sum selection means wherein said further 
multi-bit adder block calculates the two possible 
sums resulting from a carry out from the previous 
block being equal to '0' and T respectively and 
wherein the sum selection means selects the sum of 
th further multi-bit adder block according to the carry 
out calculated from the previous block. 

In the case of a Digital Signal Processor applica- 
tion, some cells will be optimised as Arithmetic Logic 
Units (ALU) while other cells may be optimised to car- 
ry out functions such as instruction decode or as 
processor registers. The number of different cells is 
only limited by the size of the array of cells. In practice 
the array will be divided into a number of discrete 
areas that are particularly efficient at implementing 
respective primary functions. ft-wHI be apparent that 
each of these cells has the capability to implement 
another function and usually a range of other func- 
tions according to other configurations. These addi- 
tional functions are control! d by the controller and 
sequencer whose rol is to nsure that the corr ct 
function is available when required. Primary func- 
tions may use general interconnect resources, but 
preferably they have their own dedicated resource for 
high -sp ed connections between primary functions 



of other cells. In this way the performance of th de- 
vice is not dependent on a general programmable in- 
terconn ct resource and by conn cting primary func- 
tions through resources with smaller parasitic loads, 

5 the device can operate faster. 

In order to safeguard data when changing be- 
tween configurations each cell has a latch controlled 
by a function control bit Transient current is reduced 
when switching between configurations by the provi- 

10 sion of a buffer in each cell, the buffer being control- 
lable as to is state during reconfiguration by a control 
line. 

It will be apparent that whilst this device has spe- 
cific application in the field of numerical engines such 
15 as DSP's, the primary functions can be chosen to suit 
other applications. Accordingly the techniques can be 
applied to any application. For example, another ap- 
plication is as a programmable communications de- 
vice. 

20 Another aspect of the invention also provides a 

method of configuring a configurable semi-conductor 
integrated circuit in which a sequence is programmed 
with data to facilitate selection of a required configur- 
ation from at least two possibilities. Usually each of 

25 a plurality of cells will have at least two configuration 
possibilities. Advantageously the configurations are 
programmable and the method further comprises in- 
putting and storing configuration data. A further ad- 
vantageous feature is the ability to program the se- 

30 quences to write over previously stored configuration 
data at a prescribed point in operation of the circuit 
An aspect of the invention provides a semi-conductor 
integrated circuit in which the circuit configuration is 
changed according to a pre-programmed sequence of 

35 configuration during operation of the device. 

The present invention will now be described by 
way of example only with reference to the accompa- 
nying drawings; in which:- 

Figure 1 is a schematic layout for re-configurable 

40 application specific device embodying the inven- 

tion; 

Figures 2 and 3 illustrate diagram mat ically the 
feature of the core architecture having different 
configurations and sequential access; 

45 Figure 4 illustrates diagrammatically the feature 

of the core having cells which are optimised to im- 
plement specific functions; 
Figure 5 illustrates diagrammatically a primary 
configuration for the device as a Digital Signal 

so Processor (DSP); 

Figure 6 illustrates diagrammatically a second- 
ary configuration for th device as a large multi- 
plier; 

Figure 7 illustrates schematically the layout of a 
55 cell including configuration memory means; 

Figure 8a illustrates diagrammatically the possi- 
ble arrang m nt of the cells in blocks with opti- 
mised functions; 
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Figure 8b illustrates schematically programma- 
ble local and global interconn ct resources for 
the cells; 

Figures 9a and 9b illustrate diagrammatically 
how the global interconnect resources are con- 5 
nected to the cell input and output multiplexers; 
Figures 9c and 9d illustrate diagrammatically an 
array of cell blocks and the arrangement of cells 
within a cell block; 

Figure 10 illustrates diagrammatically cell output 10 
state control; 

Figures 11, 12 and 13 illustrate diagrammatically 
three logic cell variants namely an Arithmetic 
Logic Unit function (ALU), an Accumulator func- 
tion (ACC), and a Decode cell function respec- 15 
tively; 

Figure 14 illustrates diagrammatically examples 
of different functions from the ALU and ACC opti- 
mised core cells; 

Figure 15 illustrates diagrammatically details of 20 
configurable Static Random Access Memory pro- 
visions; 

Figure 16 illustrates diagrammatically further de- 
tails of the cell configuration memory; 
Figure 1 7 illustrates diagrammatically instruction 25 
bus connections for DSP cells; 
Figure 18 illustrates diagrammatically a novel 
parallel carry select adder architecture which can 
be configured by the device; 

Figure 19 illustrates a cell configured to imple- 30 
ment a single stage carry select adder; 
Figure 20 illustrates a cell configured to imple- 
ment two carry select adders; 
Figure 21 illustrates an alternative cell configur- 
ation to implement a single stage carry select ad- 35 
der; and 

Figure 22 illustrates diagrammatically a DSP 
Timing Diagram. 

The present invention is described in the context 
of an integrated circuit intended for an application 40 
sp ctf ic device and will be described by way of exam- 
ple in the specific context of a Digital Signal Proces- 
sor (DSP). According to the invention the device is not 
restricted to a fixed architecture, but has the hard- 
ware re-configurable to allow the device (eg. DSP) to 45 
be optimised for each individual task. Thus at a macro 
level the device may be optimised for a new applica- 
tion for example MPEG, Polygon Engine, Blitter, DMA 
Engine, whilst at a micro level, the device can be opti- 
mised for each OPCODE, eg. MULTIPLE ALU, CUS- so 
TOM MULTIPLY. Thus a re-configurable application 
specific device (eg. DSP) allows many custom devic- 
es to be replaced with a single chip. Optimised OP- 
CODES increase p rformance. In effect the device 
can switch at clock speed between operating as a 55 
DSP, RISC or custom processor. 

Referring firstly to Figure 1 , her there is illustrat- 
ed a re-configurable application specific digital signal 



processor. Th chip includes an area 1 of core cells, 
Partitioned Static Random Access Memory (SRAM). 
3, a sequencer and controller 5 having control lines 
7, clocks 9 and clock lines 11, as well as programma- 
ble input/output 13 and associated data bus 15. Also 
shown is a signal Decompress decoder 17, a commu- 
nications link 19 and associated input/output and Ex- 
pansion porting 21, and address bus 23. 

There are a plurality of core cells 2 and these pro- 
vide for example (in the case of a DSP configuration), 
Instruction Decode, registers, programme counter 
and stack pointer facilities. Each core cell can be pro- 
grammed to perform a range of functions and certain 
core cells are optimised to implement specific func- 
tions. Thus, for example, reference to Figure 4 illus- 
trates optimisation of certain cells for ALU functions 
as at 2a, registers 2b, programme counter 2c, general 
counter 2d, instruction decode 2e and input/output 2f. 

One schematic configuration of core cell denoted 
by dotted outline is shown in Figure 7 and the core cell 
includes within it a logic cell 22 having selectable 
functions (for example four). Programmable core cell 
inputs (eight) (ie. electrically selectable interconnec- 
tions) are shown at 25 applied to two 4:1 input multi- 
plexers 26,28. The cell output is shown at 27. Exam- 
ples of Logic cell configurations are described further 
with reference to Figures 11, 12,13, and 14. Input mul- 
tiplexers are controlled by respective 2-4 Decoders 
30, 32. A further 2-4 Decoder 34, controls a 4-1 Mul- 
tiplexer in the logic cell 22 and an output multiplexer 
70 is controlled by a 2-4 Decoder 48. Direct pre-wired 
connections to the logic cell are indicated by numeral 
YA-YD. 

In the Figure 7 illustration the cell includes con- 
figurable memory provisions comprising conf igura- 
tion cache 36 and instruction cache 38, as well as so 
called "hard wired" or fixed configuration provisions 
40. For the DSP application the fixed configurations 
comprise a primary DSP Boot Configuration set by 3 
x 2 bit configuration elements 40a, and a secondary 
configuration eg. Multiplier configuration set by 3 x 2 
bit conf iguration elements 40b. It is intended that the 
primary (fixed) configuration will be implemented au- 
tomatically on boot-up of the device so as to give it its 
primary application specific function. 

The configuration cache 36 in the illustrated em- 
bodiment comprises four, 3 x 2 bit data stores, 36a-d 
which can be write enabled from an instruction up- 
date bus 44 and written with data from Data bus 46. 
The instruction cache 38 comprises 8 x 2 bit data 
stores which are write enabled from the Instruction 
update bus (44) and written with data from the data 
bus 46. The instruction cache 38 is read enabled from 
the Instruction sel ct bus 42. A 2-4 Decoder 48 en- 
abled from the instruction select bus 42 selects and 
read enables one of the four data stores 36a-d ac- 
cording to the data stor of the instruction cache se- 
lected. The output of Decoder 48 also facilitates the 
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direct configuration of the logic cell by controlling th 
4:1 output multiplexer 70. Also illustrated is a function 
control bit 50 and has connections from the read and 
write enable lines (42,44) and into the logic cell 22. 
The function control bit 50 controls latch 54 (see Fig- 
ure 10). 

Figure 16 illustrates, for the fixed configuration 
provisions (40) and the configuration cache 36, the 
read (42), write 44' and data 46' connections. Note 
both read and write provisions for the configuration 
cache 36 only. 

Reverting back to Figures 2 and 3, each of blocks 
2', 2" and 2"' represent configurations of the core 2. 
Large blocks of functionality are accessed as a series 
of configurations. Each new configuration receives 
data from the last using inter-process connections 52 
and cells 54 designated for latching critical data. 
Other cells 54 are designated to act as inputs or out- 
puts. Reconfiguration time can be of the order of 
10nsec. The core architecture is optimised to imple- 
ment each OPCODE. This allows the word size of 
ach arithmetic function to be adjusted to the required 
provision. Thus, referring to Figure 3, a first core con- 
figuration (OPCODE 1) executes a 1 6 bit multiply and 
cos function, a second core configuration (OPCODE 
2) carries out a 32 x 32 bit multiply function, and a 
third configuration (OPCODE 3) carries out a 64 bit 
ADD function. 

Reference is now made to Figure 10 which illus- 
trates the output state control as applicable to the like 
of the cell illustrated in Figure 7 and the correspond- 
ing cell components appropriately referenced are illu- 
strated with the exception of the instruction cache 38. 

As has been mentioned above certain cells are 
designated for latching critical data and hence the 
cells have a latch provision 54 with inputs from the 
function control bit 50 and a hold input line 56. These 
function to preserve the state of data from cells be- 
tween configurations. In addition a buffer 60 is pro- 
vided in order to reduce transient current when 
switching between configurations by setting its out- 
put state to a known condition. 

The cells interconnect resources are now descri- 
bed with reference to Figures 8a, 8b, 9a, and 9b. Fig- 
ures 8a and 8b show diagrammatically how cells 
might be arranged in regular blocks (B) (eg. rows and 
columns), with the blocks including ceils which are 
optimised for different functions. Thus Figure 8b 
shows columns of ACC cells, ALU cells and shift cells, 
and two rows of Decode cells. Columns of cells each 

have two global (Y) buses (Y1, Y2, Y3, Y4 YN1, 

YN) and the rows of cells each hav at least two glo- 
bal (X) buses (X1, X2 Xn-1, Xn). The Decode cells 

head up the columns of each block and have three X 
buses. Bus switches BS are provided in the Y buses 
between adjacent blocks. In addition there are hidden 
(or pre-wired direct connection) Y buses, YA-YD. 
These run from the decode ceils to all the cells in the 



column below. In addition local direct conn ction 
paths are preferred between cells. Thus, taking as an 
exampl cell SCinFigure8b.it has input conn ctions 
from outputs of an upper adjacent cell, a lower adja- 

5 cent cell, a right adjacent cell, a left adjacent cell, and 
a next left adjacent cell. These connections are des- 
ignated U, D, R, L, J. Not all cell variations will nec- 
essarily have all the local connections. The majority 
of these local connections are electrically selectable 

10 as to their conduction state, but most usually the left 
adjacent connection will be a pre-wired connection. 

Figure 9a illustrates, for one cell as for all core 
cells, how an input multiplexer 26 controls selection 
of inputs from X and Y buses and an output multiplex- 
es er 70 controls selection of outputs to the same X bus- 
es and next column of Y buses. 

The cells are arranged in 1 0 x 8 blocks and an ex- 
ample of such an array of cell blocks is illustrated in 
Figure 9c. Blocks 1 00 are formed in an 8 x 4 array and 

20 a programmable input/output 102, data buses and 
switches 104 and partitioned SRAM 106 are also 
shown. Each block 100 comprises an array of 10 x 8 
cells and conveniently, columns of cells within the 
block have a similar primary configuration. Forexam- 

25 pie, Figure 9d illustrates a block 100 having two col- 
umns of cells 100 a & b configured as multiplexer 
cells, columns 100 c as a product adder, 100d barrel 
shifter cells, 100 e arthimetic and logic cells, 1 00 f ac- 
cumulator cells and columns 100 g & h configured as 

30 multiplier expansion cells. The columns in each block 
are headed up by decode cells. 

Referring now to Figure 15, the configurable sta- 
tic random access memory (SRAM) 3 stores partition 
data passed to it from the sequencer and controller 5 

35 along partition data bus 72. The operation of the DSP 
requires the storing and retrieving of data and the pro- 
vision of the SRAM on the device ensures that access 
to the stored data is faster than if the SRAM was lo- 
cated externally. 

40 The sequencer and controller 5 controls the op- 

eration of buses 42, 44, 45 and 46. Hence, the se- 
Cquencer and controller 5 includes the control of the 
operation of selecting individual data stores of cells, 
sending data to the stores and controlling the se- 

45 quence of implementation of configuration data stor- 
ed within cell. The necessary control instructions for 
the sequencer and controller 5 is provided by an ex- 
ternal source of memory (not shown). In addition to 
the above operations, the controller 5 can select in- 

so dividual data stores not currently used such that they 
can be updated with new configurations from the ex- 
ternal memory. 

Figures 11, 12 and 13 illustrate respective ALU, 
ACC and Decode cell variants. Appropriate refer nc- 

55 es have been used as previously referred to. 

Figure 13 shows an example of a cell optimised 
for decode. Two decode c lis will head up the blocks 
of cells as shown in Figures 8a and 8b. The illustrated 
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variation is th one which has the pre-wired intercon- 
n ction YA, YB which feed down to each of the cells 
b low. The other d code will generate the YC, YD 
pre-wired interconnections. Thus the ALU type cells 
of Figure 11 have pre-wired connections YA, YB, s 
whilst the ACC type cells have pre-wired connections 
YA, YB, YC, YD. Note also that for the ALU and ACC 
variants the left adjacent connection L is pre-wired, 
and for the ALU cell the Cin, Cout is a pre-wired inter- 
connection running the length of the column of cells. 10 
Other X and Y buses are as described above. 

Control signals from the outputs of the decode 
and for inputs of the cell variants will be pre-wired for 
the optimised cell functions, ie. for any functions 
which are known to be needed for the specific appli- 1 5 
cation. 

Figure 14 illustrates some of the different func- 
tions which are available from the ACC and ALU core 
c lis of Figures 10 and 11 respectively. 

Figure 17 illustrates an alternative internal cell 20 
arrangement for the case of DSP cells (shown simpli- 
fied) with the cell input shown simply at 25 and cell 
output at 27. The memory comprises 8 x 3 bit data 
stores and a 3-8 Decoder 80 is provided such that one 
of the eight selectable options (eg. functions or inter- 25 
connect) contained in the logic cell can be selected. 
In order to update a particular data store within a par- 
ticular cell there is provided a memory select 45 
(omitted from the illustrations of the previously de- 
scribed cell arrangement) and hence the required cell 30 
can be selected arid the particular data store to be 
write enabled or read enabled is selected by the in- 
struction update bus (44) or instruction bus (42). Data 
is written to the data store from memory data bus (46) 
(not illustrated in Figure 17). 35 

A novel adder structure which can be configured 
by the device will now be described with reference to 
Figures 18 to 21 . A 16-bit adder is illustrated in Figure 
18 and indicated generally by numeral 60. The adder 
comprises a plurality of carry select adders 62 form- 40 
ing a first multi-bit adder block 64 and a second multi- 
bit adder block 66. The adder 60 sums two 16 bit 

words indicated as a1, a2, a3 a16 and b1, b2, 

b3 b16 in order to derive a sum indicated by s1, s2, 

s3 s16 and carry element 'Cout\ 45 

First multi-bit adder block 64 sums the eight least 
significant bits of each 16 bit word and for each bit 
there is an associated carry select adder 62. Each 
carry select adder comprises two inputs An, Bn 
(wherein 'n' is the number of the bit), output 68, carry so 
in 70, carry out 72 and a first and second 2:1 multi- 
plexer 74, 76. The first input to the first multiplexer 74 
is equal to the value of An + Bn assuming th carry 
in is '0' and the second input assumes that carry in to 
be T. The output Sn is selected by the carry in 70. 55 

The two inputs to the second multiplexer 76 are 
equal to the carry resulting from the sum of An and 
Bn with the carry in being equal to '0' and '1'. The car- 

6 



ry out 72 is selected by carry in 70. Obviously, th car- 
ry in to the first carry select adder will be qual to '0\ 
The second multi-bit adder block 66 sums the 
eight most significant bits of each 1 6 bit word and for 
each bit there are two associated carry select adders, 
78, 80. Each of the carry select adders 78, 80 is con- 
structed in a similar manner as described above. Car- 
ry select adders 78 sum the two eight bit words ie. a 9 . 

a^0 a^ and b 9 , biO ^6, assuming that the carry 

out from the first adder block 64 is T and carry select 
adders 80 assume that the carry out is , 0\ Therefore, 
for each bit two outputs are calculated and fed into an 
associated multiplexer 82. The output providing Sn is 
selected by the carry out from the first adder block 64. 

In operation, the first adder block calculates the 
addition of the eight least significant bits and produc- 
es a carry out value. Simultaneously, the second ad- 
der block calculates the two possible sums of the ad- 
dition of the most significant bits and the correct sum 
is selected by the carry out produced by adder block 
64. In consequence the time delay to calculate a 16 
bit addition is taken to be the delay in the addition of 
the first eight bits (8ADD) plus the delay in selecting 
the sum of the last eight bit ie. one multiplexer delay 
(MUX). 

For each additional eight bit adder block the time 
delay is equal to one multiplexer. For example, a thirty 
two bit adder would result in a propagation delay of 
8 ADD + 3 X MUX In consequence, the adder struc- 
ture described results in an improved speed of oper- 
ation compared to that of a conventional adder struc- 
ture. 

Figure 20 illustrates an alternative cell structure 
wherein the two carry select adder requiring two cells 
can be replaced by a single configured cell. 

Figure 21 illustrates a conventional circuit for sin- 
gle stage carry select adder which may be used as an 
alternative to the circuit of Figure 19. 

The operation of the device will now be described 
wherein initially, as described above, the configura- 
tion provisions 40 are 'hard wired* orf ixed with a DSP 
configuration 40a and a multipler configuration 40b. 

An external memory store (not shown) contains 
all the necessary configuration data in order to con- 
trol the controller and sequencer such that each of the 
data stores (36a-d, 38) in each cell can be program- 
med. In order to program a data store a typical proce- 
dure would be to firstly select the cell by memory se- 
lect 45, select the data store to be write enabled by in- 
struction update bus 44 and to write data to the se- 
lected store via data bus 46. 

Each of the four data stores of the configuration 
cache 36 contains sufficient configuration data to se- 
lect the input to the logic cell 22 and to also select one 
of the functions contained within the logic cell. 

The initial boot up operation of the device results 
in a configuration as p r either of the primary config- 
urations 40a, 40b according to the boot up instruc- 
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tion. Thus for xample the DSP or Multiplier configur- 
ation is stablished. 

However, if the device is required to implement 
another configuration eg. a divide function, then the 
controller and sequencer 5 selects and write enables 
the required data store of the configuration cache 36 
of each cell necessary to implement the configura- 
tion. The external memory supplies the necessary 
data as to which cell and data stores are to be select- 
ed in order to implement the required configuration. 

There is also the option for adopting other pro- 
grammed configurations from the configuration 
cache and for writing and substituting other configur- 
ations. 

Thus for the example given, the four configura- 
tions possible from the configuration cache may not 
be sufficient. Software programming can be used to 
implement another configuration. The programmer 
will be able to refer to the technical specifications for 
the device and determine how the desired func- 
tion/configuration can be implemented (for example 
many possible architecture changes will be listed, 
p rhaps in terms of a load instruction). Thus whilst 
load instructions 1-4 might represent the most typical 
configurations which are to be stored in the configur- 
ation cache, the programmer determines from the 
technical specification that load instruction 33 for ex- 
ample is required. Thus the programmer will have the 
instruction loaded into the configuration cache. There 
will be instances where more configurations are re- 
quired to process the incoming data then can be stor- 
ed in the cell memory for access at clock speed. How- 
ever, this difficulty can be overcome by re-program- 
ming a "redundant" configuration cache with the "ad- 
ditionaT configuration data in advance of its require- 
m nt, by including the re-configuration instruction in 
the software programme. The sequencer can control 
re-configuration at dock speed, whilst the data from 
th configuration is held safe in the latch cells. The 
four configurations (36a-36d) of the cache can be re- 
used in different combinations at different cell sites. 
This is facilitated by instruction cache (38) which can 
select different local cell configurations from a global 
instruction placed on instruction bus 42. 



Claims 

1. A configurable semi-conductor integrated circuit 
in which an area (1) thereof is formed with a plur- 
ality of cells (2) each having at least one function 
and inter-connections with at I ast som other 
said cells (2), characteris d in that at least some 
of the plurality of cells (2) have interconnections 
(25) which are electrically selectabl as to their 
conduction state, and at least some of the plural- 
ity of cells (2) have interconnections (YA-YD) 
which are pre-wired, each ceil has two or more 



possible configurations, each configuration be- 
ing defined by th cell function and/or its inter- 
connection with other cells according to cell con- 
figuration data, and furth r comprising means 
5 (36, 38, 40) storing configuration data for at least 

two cell configurations (per cell) and means (30, 
32, 34, 42, 48) to enable one of the possible cell 
configurations according to the cell configuration 
data selected. 

10 

2. A configurable semi-conductor integrated circuit 
as claimed in claim 1 in which means (36, 38, 40) 
storing at least two cell configurations are pres- 
ent in the cell. 

15 

3. An integrated circuit as claimed in claim 1 or 2 in 
which the means for selecting the required cell 
configuration comprises an instruction bus (42) 
communicating with the said configuration data 

20 store. 

4. An integrated circuit as claimed in claim 1 , 2 or 3 
in which at least one of the cell configurations is 
pre-wired (40a, 40b) to configure the integrated 

25 circuit with an application specific function when 

selected. 

5. An integrated circuit as claimed in daim 4 in 
which there are two pre-wired (40a, 40b) applica- 

30 tion specific functions. 

6. An integrated circuit as daimed in any one of the 
preceding daims in which there is at least one 
programmable cell configuration. 

35 

7. An integrated circuit as daimed in any one of the 
preceding daims in which there are both pre-wi- 
red and programmable cell configurations. 

40 8. An integrated circuit as daimed in daim 6 or 7 fur- 
ther comprising a write enable bus (44), and a 
data bus (46) communicating with the means (36, 
38) storing the cell configuration data for the pur- 
pose of rewriting data to the store for re-program- 

45 ming purposes. 

9. An integrated circuit as daimed in any one of the 
preceding daims further comprising means stor- 
ing a plurality of configuration selection instruc- 

so tions, an instructions select bus (42) communi- 

cating with said means and an output signal path 
for selecting the required configuration data store 
to be implemented or directly effecting cell con- 
figuration. 

55 

10. An integrated circuit as daimed in claim 9 further 
comprising an instruction write bus (44) and an 
instruction data bus (46) for writing to the instruc- 
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tion storing means (36, 38). 

1 1. An integrat d circuit as claimed in any on of the 
preceding claims in which means (54) is provided 
to preserve the output between configurations. 

12. An integrated circuity as claimed in claim 11 in 
which said means comprises a latch (54) wherein 
each cell incorporates a latch to preserve its out- 
put 

1 3. An integrated circuit as claimed in any one of the 
preceding claims in which the cells are optimised 
for a primary function. 

14. An integrated circuit as claimed in claim 13 com- 
prising cells which are optimised for different pri- 
mary functions. 

15. An integrated circuit as claimed in any one of the 
preceding claims including means (60) to reduce 
transient current when switching between config- 
urations. 

16. An integrated circuit as claimed in claim 15 in 
which said means comprises a controllable buffer 
(60) in the output line of each cell. 

17. An integrated circuit as claimed in any one of the 
preceding claims further comprising sequencer 
means (5) to control the availability and selection 
of the configuration. 



being equal to 'O' and '1' respectively and 
wherein the sum selection means selects the 
sum of the further multi-bit adder block according 
to the carry out calculated from the previous 
5 block. 

22. A method of configuring a configurable semi- 
conductor integrated circuit having a plurality of 
cells (2) with at least two configuration possibili- 

10 ties in which a sequencer (5) is programmed with 

data to facilitate selection of the required cell con- 
figuration. 

23. A method as claimed in claim 22 further compris- 
15 ing inputting and storing cell configuration data. 

24. A method as claimed in claim 22 or 23 further 
comprising programming the sequencer with 
data to write over previously stored configuration 

20 data at a prescribed point in operation of the cir- 

cuit 

25. A configurable semi-conductor integrated circuit 
characterised in that circuit configuration is 

25 changed according to a pre-programmed se- 

quence of configurations during operation of the 
device. 

26. An integrated circuit as claimed in claim 25 in 
30 which an area there is formed with a plurality of 

cells, each cell having two or more possible con- 
figurations, each configuration being defined by 
the cell function and for its interconnection with 
other cells according to conf iguration data. 



18. An integrated circuit as claimed in any one of the 
preceding claims comprising decode means (30, 35 
32, 34, 48) in each cell (2) to decode configura- 
tion state to control the configuration of each cell. 

19. An integrated circuit as claimed in any one of 
claims 4 or 5, 13 or 14 in which the configuration 40 
data store corresponding to the primary or appli- 
cation specific function of the cell is contained 
within the device in a non-volatile memory. 

20. An integrated circuit as claimed in any one of 45 
claims 13 or 14 in which the pre-wired (hidden) in- 
terconnect resources interconnect optimised 
cells for efficient implementation of the primary 
(application specific) functions. 

so 

21. A multi-bit adder for summing at least two multi- 
bit words comprising a first multi-bit adder block 
(64) for summing the least significant bits and at 
least one further multi-bit adder block (66) for 
summing the most significant bits and having 55 
sum s lection means wherein said further multi- 
bit adder block calculates the two possible sums 
resulting from a carry out from the previous block 
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